Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 149956 |
| Missing cells | 1929140 |
| Missing cells (%) | 67.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 22.9 MiB |
| Average record size in memory | 160.0 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 4 |
| Text | 2 |
| DateTime | 1 |
footnote has constant value "" | Constant |
data_type is highly imbalanced (50.1%) | Imbalance |
derivation_id is highly imbalanced (89.3%) | Imbalance |
unit_name is highly imbalanced (51.9%) | Imbalance |
data_type has 114578 (76.4%) missing values | Missing |
description has 114586 (76.4%) missing values | Missing |
food_category_id has 114600 (76.4%) missing values | Missing |
publication_date has 114578 (76.4%) missing values | Missing |
id has 35378 (23.6%) missing values | Missing |
nutrient_id has 35851 (23.9%) missing values | Missing |
amount has 35851 (23.9%) missing values | Missing |
data_points has 38324 (25.6%) missing values | Missing |
derivation_id has 35860 (23.9%) missing values | Missing |
min has 139150 (92.8%) missing values | Missing |
max has 139150 (92.8%) missing values | Missing |
median has 138548 (92.4%) missing values | Missing |
footnote has 149953 (> 99.9%) missing values | Missing |
min_year_acqured has 124305 (82.9%) missing values | Missing |
name has 149483 (99.7%) missing values | Missing |
unit_name has 149483 (99.7%) missing values | Missing |
nutrient_nbr has 149495 (99.7%) missing values | Missing |
rank has 149494 (99.7%) missing values | Missing |
amount is highly skewed (γ1 = 45.27538282) | Skewed |
min is highly skewed (γ1 = 57.3891599) | Skewed |
max is highly skewed (γ1 = 44.33747669) | Skewed |
median is highly skewed (γ1 = 52.0960738) | Skewed |
amount has 25966 (17.3%) zeros | Zeros |
min has 3430 (2.3%) zeros | Zeros |
max has 2422 (1.6%) zeros | Zeros |
median has 2967 (2.0%) zeros | Zeros |
Reproduction
| Analysis started | 2023-06-21 23:05:45.346755 |
|---|---|
| Analysis finished | 2023-06-21 23:06:02.552043 |
| Duration | 17.21 seconds |
| Software version | ydata-profiling vv4.3.1 |
| Download configuration | config.json |
fdc_id
Real number (ℝ)
| Distinct | 35378 |
|---|---|
| Distinct (%) | 23.7% |
| Missing | 473 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 753370.81 |
| Minimum | 319874 |
|---|---|
| Maximum | 2007412 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 319874 |
|---|---|
| 5-th percentile | 322258.1 |
| Q1 | 328451 |
| median | 335473 |
| Q3 | 790379 |
| 95-th percentile | 2003599 |
| Maximum | 2007412 |
| Range | 1687538 |
| Interquartile range (IQR) | 461928 |
Descriptive statistics
| Standard deviation | 612415.95 |
|---|---|
| Coefficient of variation (CV) | 0.81290108 |
| Kurtosis | -0.2804282 |
| Mean | 753370.81 |
| Median Absolute Deviation (MAD) | 12582 |
| Skewness | 1.1659583 |
| Sum | 1.1261613 × 1011 |
| Variance | 3.750533 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1999630 | 160 | 0.1% |
| 322228 | 158 | 0.1% |
| 746782 | 158 | 0.1% |
| 1750337 | 158 | 0.1% |
| 322559 | 158 | 0.1% |
| 746778 | 158 | 0.1% |
| 321359 | 158 | 0.1% |
| 746776 | 158 | 0.1% |
| 322892 | 158 | 0.1% |
| 746772 | 158 | 0.1% |
| Other values (35368) | 147901 | |
| (Missing) | 473 | 0.3% |
| Value | Count | Frequency (%) |
| 319874 | 1 | < 0.1% |
| 319875 | 1 | < 0.1% |
| 319876 | 1 | < 0.1% |
| 319877 | 5 | |
| 319878 | 10 | |
| 319879 | 1 | < 0.1% |
| 319880 | 1 | < 0.1% |
| 319881 | 1 | < 0.1% |
| 319882 | 5 | |
| 319883 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 2007412 | 2 | |
| 2007411 | 2 | |
| 2007410 | 2 | |
| 2007409 | 2 | |
| 2007408 | 2 | |
| 2007407 | 2 | |
| 2007406 | 2 | |
| 2007405 | 2 | |
| 2007404 | 2 | |
| 2007403 | 2 |
data_type
Categorical
IMBALANCE  MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 114578 |
| Missing (%) | 76.4% |
| Memory size | 2.3 MiB |
| sub_sample_food | |
|---|---|
| market_acquisition | |
| sample_food | 2208 |
| agricultural_acquisition | 810 |
| foundation_food | 223 |
Length
| Max length | 24 |
|---|---|
| Median length | 15 |
| Mean length | 15.440274 |
| Min length | 11 |
Characters and Unicode
| Total characters | 546246 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | sample_food |
|---|---|
| 2nd row | market_acquisition |
| 3rd row | market_acquisition |
| 4th row | sub_sample_food |
| 5th row | sub_sample_food |
Common Values
| Value | Count | Frequency (%) |
| sub_sample_food | 26431 | 17.6% |
| market_acquisition | 5706 | 3.8% |
| sample_food | 2208 | 1.5% |
| agricultural_acquisition | 810 | 0.5% |
| foundation_food | 223 | 0.1% |
| (Missing) | 114578 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| sub_sample_food | 26431 | |
| market_acquisition | 5706 | 16.1% |
| sample_food | 2208 | 6.2% |
| agricultural_acquisition | 810 | 2.3% |
| foundation_food | 223 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 64686 | |
| _ | 61809 | |
| s | 61586 | |
| a | 42704 | 7.8% |
| u | 34790 | 6.4% |
| m | 34345 | 6.3% |
| e | 34345 | 6.3% |
| l | 30259 | 5.5% |
| d | 29085 | 5.3% |
| f | 29085 | 5.3% |
| Other values (10) | 123552 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 484437 | |
| Connector Punctuation | 61809 | 11.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 64686 | |
| s | 61586 | |
| a | 42704 | |
| u | 34790 | 7.2% |
| m | 34345 | 7.1% |
| e | 34345 | 7.1% |
| l | 30259 | 6.2% |
| d | 29085 | 6.0% |
| f | 29085 | 6.0% |
| p | 28639 | 5.9% |
| Other values (9) | 94913 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 61809 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 484437 | |
| Common | 61809 | 11.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 64686 | |
| s | 61586 | |
| a | 42704 | |
| u | 34790 | 7.2% |
| m | 34345 | 7.1% |
| e | 34345 | 7.1% |
| l | 30259 | 6.2% |
| d | 29085 | 6.0% |
| f | 29085 | 6.0% |
| p | 28639 | 5.9% |
| Other values (9) | 94913 |
Common
| Value | Count | Frequency (%) |
| _ | 61809 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 546246 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 64686 | |
| _ | 61809 | |
| s | 61586 | |
| a | 42704 | 7.8% |
| u | 34790 | 6.4% |
| m | 34345 | 6.3% |
| e | 34345 | 6.3% |
| l | 30259 | 5.5% |
| d | 29085 | 5.3% |
| f | 29085 | 5.3% |
| Other values (10) | 123552 |
description
Text
MISSING 
| Distinct | 11406 |
|---|---|
| Distinct (%) | 32.2% |
| Missing | 114586 |
| Missing (%) | 76.4% |
| Memory size | 2.3 MiB |
Length
| Max length | 128 |
|---|---|
| Median length | 110 |
| Mean length | 39.958128 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1413319 |
|---|---|
| Distinct characters | 78 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 10969 ? |
|---|---|
| Unique (%) | 31.0% |
Sample
| 1st row | HUMMUS, SABRA CLASSIC |
|---|---|
| 2nd row | HUMMUS, SABRA CLASSIC |
| 3rd row | HUMMUS, SABRA CLASSIC |
| 4th row | Hummus |
| 5th row | Hummus |
| Value | Count | Frequency (%) |
| 9408 | 4.6% | |
| milk | 5613 | 2.7% |
| shelf | 5427 | 2.6% |
| stable | 5427 | 2.6% |
| plain | 4470 | 2.2% |
| unsweetened | 4135 | 2.0% |
| oil | 2940 | 1.4% |
| mushrooms | 2881 | 1.4% |
| raw | 2617 | 1.3% |
| cheese | 2586 | 1.3% |
| Other values (10838) | 160297 |
Most occurring characters
| Value | Count | Frequency (%) |
| 172292 | 12.2% | |
| , | 86851 | 6.1% |
| E | 70733 | 5.0% |
| e | 53339 | 3.8% |
| A | 50771 | 3.6% |
| N | 50141 | 3.5% |
| S | 44362 | 3.1% |
| L | 43277 | 3.1% |
| I | 41190 | 2.9% |
| T | 39622 | 2.8% |
| Other values (68) | 760741 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 647236 | |
| Lowercase Letter | 407791 | |
| Space Separator | 172292 | 12.2% |
| Other Punctuation | 93028 | 6.6% |
| Decimal Number | 56891 | 4.0% |
| Dash Punctuation | 15210 | 1.1% |
| Close Punctuation | 10408 | 0.7% |
| Open Punctuation | 10408 | 0.7% |
| Connector Punctuation | 54 | < 0.1% |
| Control | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 53339 | |
| a | 37245 | 9.1% |
| r | 33301 | 8.2% |
| s | 31766 | 7.8% |
| n | 30531 | 7.5% |
| o | 30025 | 7.4% |
| i | 29732 | 7.3% |
| t | 24995 | 6.1% |
| l | 17746 | 4.4% |
| d | 17417 | 4.3% |
| Other values (17) | 101694 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 70733 | 10.9% |
| A | 50771 | 7.8% |
| N | 50141 | 7.7% |
| S | 44362 | 6.9% |
| L | 43277 | 6.7% |
| I | 41190 | 6.4% |
| T | 39622 | 6.1% |
| O | 36779 | 5.7% |
| R | 33444 | 5.2% |
| F | 26332 | 4.1% |
| Other values (16) | 210585 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 16360 | |
| 1 | 15081 | |
| 2 | 9114 | |
| 9 | 4087 | 7.2% |
| 4 | 3033 | 5.3% |
| 3 | 2644 | 4.6% |
| 6 | 2011 | 3.5% |
| 7 | 1652 | 2.9% |
| 8 | 1488 | 2.6% |
| 5 | 1421 | 2.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 86851 | |
| % | 2622 | 2.8% |
| / | 2103 | 2.3% |
| & | 745 | 0.8% |
| ; | 374 | 0.4% |
| . | 189 | 0.2% |
| " | 60 | 0.1% |
| # | 43 | < 0.1% |
| ' | 41 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 172292 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15210 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 10408 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 10408 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 54 |
Control
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1055027 | |
| Common | 358292 | 25.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 70733 | 6.7% |
| e | 53339 | 5.1% |
| A | 50771 | 4.8% |
| N | 50141 | 4.8% |
| S | 44362 | 4.2% |
| L | 43277 | 4.1% |
| I | 41190 | 3.9% |
| T | 39622 | 3.8% |
| a | 37245 | 3.5% |
| O | 36779 | 3.5% |
| Other values (43) | 587568 |
Common
| Value | Count | Frequency (%) |
| 172292 | ||
| , | 86851 | |
| 0 | 16360 | 4.6% |
| - | 15210 | 4.2% |
| 1 | 15081 | 4.2% |
| ) | 10408 | 2.9% |
| ( | 10408 | 2.9% |
| 2 | 9114 | 2.5% |
| 9 | 4087 | 1.1% |
| 4 | 3033 | 0.8% |
| Other values (15) | 15448 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1413318 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 172292 | 12.2% | |
| , | 86851 | 6.1% |
| E | 70733 | 5.0% |
| e | 53339 | 3.8% |
| A | 50771 | 3.6% |
| N | 50141 | 3.5% |
| S | 44362 | 3.1% |
| L | 43277 | 3.1% |
| I | 41190 | 2.9% |
| T | 39622 | 2.8% |
| Other values (67) | 760740 |
None
| Value | Count | Frequency (%) |
| é | 1 |
food_category_id
Real number (ℝ)
MISSING 
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 114600 |
| Missing (%) | 76.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.1607365 |
| Minimum | 1 |
|---|---|
| Maximum | 25 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 9 |
| Q3 | 13 |
| 95-th percentile | 19 |
| Maximum | 25 |
| Range | 24 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 5.6993886 |
|---|---|
| Coefficient of variation (CV) | 0.62215397 |
| Kurtosis | -0.50145529 |
| Mean | 9.1607365 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.20859651 |
| Sum | 323887 |
| Variance | 32.48303 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11 | 6856 | 4.6% |
| 1 | 6406 | 4.3% |
| 9 | 5910 | 3.9% |
| 16 | 3603 | 2.4% |
| 4 | 2924 | 1.9% |
| 14 | 1860 | 1.2% |
| 5 | 1503 | 1.0% |
| 20 | 1282 | 0.9% |
| 15 | 913 | 0.6% |
| 7 | 795 | 0.5% |
| Other values (8) | 3304 | 2.2% |
| (Missing) | 114600 |
| Value | Count | Frequency (%) |
| 1 | 6406 | |
| 2 | 386 | 0.3% |
| 4 | 2924 | |
| 5 | 1503 | 1.0% |
| 6 | 568 | 0.4% |
| 7 | 795 | 0.5% |
| 9 | 5910 | |
| 10 | 613 | 0.4% |
| 11 | 6856 | |
| 12 | 267 | 0.2% |
| Value | Count | Frequency (%) |
| 25 | 474 | 0.3% |
| 20 | 1282 | 0.9% |
| 19 | 54 | < 0.1% |
| 18 | 488 | 0.3% |
| 16 | 3603 | |
| 15 | 913 | 0.6% |
| 14 | 1860 | 1.2% |
| 13 | 454 | 0.3% |
| 12 | 267 | 0.2% |
| 11 | 6856 |
publication_date
Date
MISSING 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 114578 |
| Missing (%) | 76.4% |
| Memory size | 2.3 MiB |
| Minimum | 2019-04-01 00:00:00 |
|---|---|
| Maximum | 2021-10-28 00:00:00 |
id
Real number (ℝ)
MISSING 
| Distinct | 114578 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 35378 |
| Missing (%) | 23.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6787445.7 |
| Minimum | 1001 |
|---|---|
| Maximum | 26845143 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 1001 |
|---|---|
| 5-th percentile | 2222558.9 |
| Q1 | 2245552.2 |
| median | 2274245.5 |
| Q3 | 8529199.8 |
| 95-th percentile | 24468563 |
| Maximum | 26845143 |
| Range | 26844142 |
| Interquartile range (IQR) | 6283647.5 |
Descriptive statistics
| Standard deviation | 7174508.5 |
|---|---|
| Coefficient of variation (CV) | 1.0570263 |
| Kurtosis | 0.89886655 |
| Mean | 6787445.7 |
| Median Absolute Deviation (MAD) | 43533 |
| Skewness | 1.4983761 |
| Sum | 7.7769195 × 1011 |
| Variance | 5.1473573 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8519641 | 1 | < 0.1% |
| 8519689 | 1 | < 0.1% |
| 8519697 | 1 | < 0.1% |
| 8519698 | 1 | < 0.1% |
| 8519644 | 1 | < 0.1% |
| 8519645 | 1 | < 0.1% |
| 8519643 | 1 | < 0.1% |
| 8519642 | 1 | < 0.1% |
| 8519640 | 1 | < 0.1% |
| 8519637 | 1 | < 0.1% |
| Other values (114568) | 114568 | |
| (Missing) | 35378 | 23.6% |
| Value | Count | Frequency (%) |
| 1001 | 1 | |
| 1002 | 1 | |
| 1003 | 1 | |
| 1004 | 1 | |
| 1005 | 1 | |
| 1006 | 1 | |
| 1007 | 1 | |
| 1008 | 1 | |
| 1009 | 1 | |
| 1010 | 1 |
| Value | Count | Frequency (%) |
| 26845143 | 1 | |
| 24474339 | 1 | |
| 24474338 | 1 | |
| 24474337 | 1 | |
| 24474336 | 1 | |
| 24474335 | 1 | |
| 24474334 | 1 | |
| 24474332 | 1 | |
| 24474331 | 1 | |
| 24474330 | 1 |
nutrient_id
Real number (ℝ)
MISSING 
| Distinct | 221 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 35851 |
| Missing (%) | 23.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1219.1721 |
| Minimum | 1002 |
|---|---|
| Maximum | 2063 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 1002 |
|---|---|
| 5-th percentile | 1004 |
| Q1 | 1091 |
| median | 1162 |
| Q3 | 1289 |
| 95-th percentile | 2006 |
| Maximum | 2063 |
| Range | 1061 |
| Interquartile range (IQR) | 198 |
Descriptive statistics
| Standard deviation | 228.66961 |
|---|---|
| Coefficient of variation (CV) | 0.18756139 |
| Kurtosis | 5.9387737 |
| Mean | 1219.1721 |
| Median Absolute Deviation (MAD) | 101 |
| Skewness | 2.3880506 |
| Sum | 1.3911363 × 108 |
| Variance | 52289.79 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1051 | 3028 | 2.0% |
| 1004 | 2998 | 2.0% |
| 1098 | 2737 | 1.8% |
| 1090 | 2737 | 1.8% |
| 1091 | 2736 | 1.8% |
| 1101 | 2736 | 1.8% |
| 1087 | 2735 | 1.8% |
| 1095 | 2735 | 1.8% |
| 1089 | 2731 | 1.8% |
| 1092 | 2724 | 1.8% |
| Other values (211) | 86208 | |
| (Missing) | 35851 |
| Value | Count | Frequency (%) |
| 1002 | 1915 | |
| 1003 | 1188 | 0.8% |
| 1004 | 2998 | |
| 1005 | 177 | 0.1% |
| 1007 | 2128 | |
| 1008 | 135 | 0.1% |
| 1009 | 1042 | 0.7% |
| 1010 | 887 | 0.6% |
| 1011 | 890 | 0.6% |
| 1012 | 889 | 0.6% |
| Value | Count | Frequency (%) |
| 2063 | 18 | < 0.1% |
| 2062 | 34 | < 0.1% |
| 2061 | 34 | < 0.1% |
| 2060 | 34 | < 0.1% |
| 2059 | 91 | |
| 2057 | 91 | |
| 2053 | 36 | < 0.1% |
| 2052 | 70 | |
| 2051 | 18 | < 0.1% |
| 2050 | 18 | < 0.1% |
amount
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 5984 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 35851 |
| Missing (%) | 23.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 77.309247 |
| Minimum | 0 |
|---|---|
| Maximum | 40700 |
| Zeros | 25966 |
| Zeros (%) | 17.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.002 |
| median | 0.22 |
| Q3 | 6.31 |
| 95-th percentile | 318 |
| Maximum | 40700 |
| Range | 40700 |
| Interquartile range (IQR) | 6.308 |
Descriptive statistics
| Standard deviation | 671.96277 |
|---|---|
| Coefficient of variation (CV) | 8.691881 |
| Kurtosis | 2512.9354 |
| Mean | 77.309247 |
| Median Absolute Deviation (MAD) | 0.22 |
| Skewness | 45.275383 |
| Sum | 8821371.6 |
| Variance | 451533.97 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 25966 | 17.3% |
| 0.001 | 1917 | 1.3% |
| 0.002 | 1754 | 1.2% |
| 0.003 | 1456 | 1.0% |
| 0.004 | 1022 | 0.7% |
| 0.005 | 912 | 0.6% |
| 0.006 | 717 | 0.5% |
| 0.007 | 710 | 0.5% |
| 0.01 | 689 | 0.5% |
| 0.008 | 628 | 0.4% |
| Other values (5974) | 78334 | |
| (Missing) | 35851 |
| Value | Count | Frequency (%) |
| 0 | 25966 | |
| 3.75 × 10-5 | 1 | < 0.1% |
| 6.25 × 10-5 | 1 | < 0.1% |
| 0.000125 | 11 | < 0.1% |
| 0.00025 | 2 | < 0.1% |
| 0.0003 | 2 | < 0.1% |
| 0.000375 | 3 | < 0.1% |
| 0.0004375 | 2 | < 0.1% |
| 0.0005 | 7 | < 0.1% |
| 0.0005625 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 40700 | 1 | |
| 40300 | 1 | |
| 40100 | 1 | |
| 39600 | 1 | |
| 39500 | 1 | |
| 39400 | 1 | |
| 39100 | 1 | |
| 39000 | 2 | |
| 38900 | 1 | |
| 38700 | 2 |
data_points
Real number (ℝ)
MISSING 
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 38324 |
| Missing (%) | 25.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.030493 |
| Minimum | 1 |
|---|---|
| Maximum | 252 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 8 |
| Maximum | 252 |
| Range | 251 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 7.2649315 |
|---|---|
| Coefficient of variation (CV) | 3.577915 |
| Kurtosis | 547.15659 |
| Mean | 2.030493 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 19.947138 |
| Sum | 226668 |
| Variance | 52.77923 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 100988 | |
| 8 | 2347 | 1.6% |
| 6 | 1685 | 1.1% |
| 12 | 1143 | 0.8% |
| 2 | 1127 | 0.8% |
| 4 | 810 | 0.5% |
| 3 | 597 | 0.4% |
| 5 | 520 | 0.3% |
| 11 | 442 | 0.3% |
| 24 | 204 | 0.1% |
| Other values (41) | 1769 | 1.2% |
| (Missing) | 38324 | 25.6% |
| Value | Count | Frequency (%) |
| 1 | 100988 | |
| 2 | 1127 | 0.8% |
| 3 | 597 | 0.4% |
| 4 | 810 | 0.5% |
| 5 | 520 | 0.3% |
| 6 | 1685 | 1.1% |
| 7 | 177 | 0.1% |
| 8 | 2347 | 1.6% |
| 9 | 154 | 0.1% |
| 10 | 47 | < 0.1% |
| Value | Count | Frequency (%) |
| 252 | 34 | |
| 250 | 2 | < 0.1% |
| 136 | 34 | |
| 135 | 2 | < 0.1% |
| 126 | 34 | |
| 125 | 2 | < 0.1% |
| 112 | 36 | |
| 78 | 34 | |
| 77 | 2 | < 0.1% |
| 73 | 65 |
derivation_id
Categorical
IMBALANCE  MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 35860 |
| Missing (%) | 23.9% |
| Memory size | 2.3 MiB |
| 1.0 | |
|---|---|
| 4.0 | 1686 |
| 49.0 | 787 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.0068977 |
| Min length | 3 |
Characters and Unicode
| Total characters | 343075 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 111623 | |
| 4.0 | 1686 | 1.1% |
| 49.0 | 787 | 0.5% |
| (Missing) | 35860 | 23.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 111623 | |
| 4.0 | 1686 | 1.5% |
| 49.0 | 787 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 114096 | |
| 0 | 114096 | |
| 1 | 111623 | |
| 4 | 2473 | 0.7% |
| 9 | 787 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 228979 | |
| Other Punctuation | 114096 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 114096 | |
| 1 | 111623 | |
| 4 | 2473 | 1.1% |
| 9 | 787 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 114096 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 343075 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 114096 | |
| 0 | 114096 | |
| 1 | 111623 | |
| 4 | 2473 | 0.7% |
| 9 | 787 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 343075 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 114096 | |
| 0 | 114096 | |
| 1 | 111623 | |
| 4 | 2473 | 0.7% |
| 9 | 787 | 0.2% |
min
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 1776 |
|---|---|
| Distinct (%) | 16.4% |
| Missing | 139150 |
| Missing (%) | 92.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41.144909 |
| Minimum | 0 |
|---|---|
| Maximum | 37200 |
| Zeros | 3430 |
| Zeros (%) | 2.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.06 |
| Q3 | 1.91 |
| 95-th percentile | 122 |
| Maximum | 37200 |
| Range | 37200 |
| Interquartile range (IQR) | 1.91 |
Descriptive statistics
| Standard deviation | 552.77724 |
|---|---|
| Coefficient of variation (CV) | 13.434888 |
| Kurtosis | 3790.744 |
| Mean | 41.144909 |
| Median Absolute Deviation (MAD) | 0.06 |
| Skewness | 57.38916 |
| Sum | 444611.88 |
| Variance | 305562.68 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3430 | 2.3% |
| 0.002 | 157 | 0.1% |
| 0.001 | 136 | 0.1% |
| 0.003 | 102 | 0.1% |
| 0.007 | 79 | 0.1% |
| 0.01 | 77 | 0.1% |
| 0.004 | 66 | < 0.1% |
| 0.03 | 65 | < 0.1% |
| 0.008 | 63 | < 0.1% |
| 0.006 | 62 | < 0.1% |
| Other values (1766) | 6569 | 4.4% |
| (Missing) | 139150 |
| Value | Count | Frequency (%) |
| 0 | 3430 | |
| 0.001 | 136 | 0.1% |
| 0.0015 | 2 | < 0.1% |
| 0.002 | 157 | 0.1% |
| 0.0025 | 1 | < 0.1% |
| 0.003 | 102 | 0.1% |
| 0.0035 | 2 | < 0.1% |
| 0.004 | 66 | < 0.1% |
| 0.005 | 61 | < 0.1% |
| 0.006 | 62 | < 0.1% |
| Value | Count | Frequency (%) |
| 37200 | 2 | |
| 8930 | 1 | |
| 5320 | 2 | |
| 5150 | 2 | |
| 5020 | 2 | |
| 4460 | 1 | |
| 4070 | 1 | |
| 3860 | 1 | |
| 3410 | 2 | |
| 3030 | 2 |
max
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 2019 |
|---|---|
| Distinct (%) | 18.7% |
| Missing | 139150 |
| Missing (%) | 92.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 64.185261 |
| Minimum | 0 |
|---|---|
| Maximum | 40700 |
| Zeros | 2422 |
| Zeros (%) | 1.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.002 |
| median | 0.176 |
| Q3 | 3.7 |
| 95-th percentile | 208.75 |
| Maximum | 40700 |
| Range | 40700 |
| Interquartile range (IQR) | 3.698 |
Descriptive statistics
| Standard deviation | 671.24011 |
|---|---|
| Coefficient of variation (CV) | 10.457854 |
| Kurtosis | 2530.3829 |
| Mean | 64.185261 |
| Median Absolute Deviation (MAD) | 0.176 |
| Skewness | 44.337477 |
| Sum | 693585.93 |
| Variance | 450563.28 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2422 | 1.6% |
| 0.002 | 165 | 0.1% |
| 0.003 | 148 | 0.1% |
| 0.001 | 141 | 0.1% |
| 0.006 | 99 | 0.1% |
| 0.004 | 97 | 0.1% |
| 0.01 | 84 | 0.1% |
| 0.005 | 80 | 0.1% |
| 0.007 | 73 | < 0.1% |
| 0.008 | 60 | < 0.1% |
| Other values (2009) | 7437 | 5.0% |
| (Missing) | 139150 |
| Value | Count | Frequency (%) |
| 0 | 2422 | |
| 0.0003 | 1 | < 0.1% |
| 0.0005 | 1 | < 0.1% |
| 0.001 | 141 | 0.1% |
| 0.002 | 165 | 0.1% |
| 0.003 | 148 | 0.1% |
| 0.004 | 97 | 0.1% |
| 0.005 | 80 | 0.1% |
| 0.0055 | 1 | < 0.1% |
| 0.006 | 99 | 0.1% |
| Value | Count | Frequency (%) |
| 40700 | 2 | |
| 14600 | 1 | |
| 9630 | 2 | |
| 8750 | 2 | |
| 8560 | 1 | |
| 7430 | 2 | |
| 7110 | 2 | |
| 7060 | 2 | |
| 6600 | 1 | |
| 5040 | 2 |
median
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 2146 |
|---|---|
| Distinct (%) | 18.8% |
| Missing | 138548 |
| Missing (%) | 92.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49.110512 |
| Minimum | 0 |
|---|---|
| Maximum | 38500 |
| Zeros | 2967 |
| Zeros (%) | 2.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.13 |
| Q3 | 3 |
| 95-th percentile | 158 |
| Maximum | 38500 |
| Range | 38500 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 585.24524 |
|---|---|
| Coefficient of variation (CV) | 11.916904 |
| Kurtosis | 3299.6661 |
| Mean | 49.110512 |
| Median Absolute Deviation (MAD) | 0.13 |
| Skewness | 52.096074 |
| Sum | 560252.72 |
| Variance | 342512 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2967 | 2.0% |
| 0.002 | 182 | 0.1% |
| 0.001 | 135 | 0.1% |
| 0.003 | 125 | 0.1% |
| 0.004 | 97 | 0.1% |
| 0.005 | 91 | 0.1% |
| 0.01 | 87 | 0.1% |
| 0.02 | 68 | < 0.1% |
| 0.07 | 62 | < 0.1% |
| 0.006 | 61 | < 0.1% |
| Other values (2136) | 7533 | 5.0% |
| (Missing) | 138548 |
| Value | Count | Frequency (%) |
| 0 | 2967 | |
| 0.00025 | 1 | < 0.1% |
| 0.0005 | 1 | < 0.1% |
| 0.00075 | 2 | < 0.1% |
| 0.001 | 135 | 0.1% |
| 0.001373 | 1 | < 0.1% |
| 0.0015 | 1 | < 0.1% |
| 0.00185 | 1 | < 0.1% |
| 0.002 | 182 | 0.1% |
| 0.00225 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 38500 | 2 | |
| 12800 | 1 | |
| 7860 | 2 | |
| 6550 | 2 | |
| 5800 | 2 | |
| 5760 | 1 | |
| 5560 | 2 | |
| 5040 | 2 | |
| 4280 | 1 | |
| 4210 | 1 |
footnote
Categorical
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | 33.3% |
| Missing | 149953 |
| Missing (%) | > 99.9% |
| Memory size | 2.3 MiB |
| Trace amount |
|---|
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Characters and Unicode
| Total characters | 36 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Trace amount |
|---|---|
| 2nd row | Trace amount |
| 3rd row | Trace amount |
Common Values
| Value | Count | Frequency (%) |
| Trace amount | 3 | < 0.1% |
| (Missing) | 149953 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| trace | 3 | |
| amount | 3 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6 | |
| T | 3 | |
| r | 3 | |
| c | 3 | |
| e | 3 | |
| 3 | ||
| m | 3 | |
| o | 3 | |
| u | 3 | |
| n | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 30 | |
| Uppercase Letter | 3 | 8.3% |
| Space Separator | 3 | 8.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6 | |
| r | 3 | |
| c | 3 | |
| e | 3 | |
| m | 3 | |
| o | 3 | |
| u | 3 | |
| n | 3 | |
| t | 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 3 |
Space Separator
| Value | Count | Frequency (%) |
| 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 33 | |
| Common | 3 | 8.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6 | |
| T | 3 | |
| r | 3 | |
| c | 3 | |
| e | 3 | |
| m | 3 | |
| o | 3 | |
| u | 3 | |
| n | 3 | |
| t | 3 |
Common
| Value | Count | Frequency (%) |
| 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6 | |
| T | 3 | |
| r | 3 | |
| c | 3 | |
| e | 3 | |
| 3 | ||
| m | 3 | |
| o | 3 | |
| u | 3 | |
| n | 3 |
min_year_acqured
Real number (ℝ)
MISSING 
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 124305 |
| Missing (%) | 82.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2014.9956 |
| Minimum | 1999 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 1999 |
|---|---|
| 5-th percentile | 2001 |
| Q1 | 2016 |
| median | 2016 |
| Q3 | 2016 |
| 95-th percentile | 2020 |
| Maximum | 2021 |
| Range | 22 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4.1825708 |
|---|---|
| Coefficient of variation (CV) | 0.0020757221 |
| Kurtosis | 5.29866 |
| Mean | 2014.9956 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -2.2196443 |
| Sum | 51686652 |
| Variance | 17.493899 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2016 | 16288 | 10.9% |
| 2020 | 1469 | 1.0% |
| 2018 | 1179 | 0.8% |
| 2009 | 870 | 0.6% |
| 2011 | 838 | 0.6% |
| 2013 | 739 | 0.5% |
| 2001 | 718 | 0.5% |
| 2015 | 625 | 0.4% |
| 2019 | 597 | 0.4% |
| 2021 | 538 | 0.4% |
| Other values (9) | 1790 | 1.2% |
| (Missing) | 124305 |
| Value | Count | Frequency (%) |
| 1999 | 162 | 0.1% |
| 2000 | 414 | |
| 2001 | 718 | |
| 2003 | 99 | 0.1% |
| 2006 | 38 | < 0.1% |
| 2008 | 163 | 0.1% |
| 2009 | 870 | |
| 2010 | 137 | 0.1% |
| 2011 | 838 | |
| 2012 | 348 | 0.2% |
| Value | Count | Frequency (%) |
| 2021 | 538 | 0.4% |
| 2020 | 1469 | 1.0% |
| 2019 | 597 | 0.4% |
| 2018 | 1179 | 0.8% |
| 2017 | 278 | 0.2% |
| 2016 | 16288 | |
| 2015 | 625 | 0.4% |
| 2014 | 151 | 0.1% |
| 2013 | 739 | 0.5% |
| 2012 | 348 | 0.2% |
name
Text
MISSING 
| Distinct | 472 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 149483 |
| Missing (%) | 99.7% |
| Memory size | 2.3 MiB |
Length
| Max length | 68 |
|---|---|
| Median length | 40 |
| Mean length | 14.881607 |
| Min length | 2 |
Characters and Unicode
| Total characters | 7039 |
|---|---|
| Distinct characters | 72 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 471 ? |
|---|---|
| Unique (%) | 99.6% |
Sample
| 1st row | Energy (Atwater General Factors) |
|---|---|
| 2nd row | Energy (Atwater Specific Factors) |
| 3rd row | Solids |
| 4th row | Nitrogen |
| 5th row | Protein |
| Value | Count | Frequency (%) |
| acid | 40 | 3.9% |
| pufa | 35 | 3.5% |
| total | 33 | 3.3% |
| vitamin | 29 | 2.9% |
| c | 21 | 2.1% |
| sfa | 21 | 2.1% |
| mufa | 19 | 1.9% |
| acids | 17 | 1.7% |
| fatty | 14 | 1.4% |
| fiber | 11 | 1.1% |
| Other values (515) | 774 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 556 | 7.9% |
| 544 | 7.7% | |
| a | 472 | 6.7% |
| e | 418 | 5.9% |
| n | 410 | 5.8% |
| t | 410 | 5.8% |
| o | 398 | 5.7% |
| l | 303 | 4.3% |
| c | 282 | 4.0% |
| r | 277 | 3.9% |
| Other values (62) | 2969 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4722 | |
| Uppercase Letter | 948 | 13.5% |
| Space Separator | 544 | 7.7% |
| Decimal Number | 382 | 5.4% |
| Other Punctuation | 256 | 3.6% |
| Dash Punctuation | 93 | 1.3% |
| Open Punctuation | 41 | 0.6% |
| Close Punctuation | 41 | 0.6% |
| Math Symbol | 12 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 556 | |
| a | 472 | |
| e | 418 | |
| n | 410 | |
| t | 410 | |
| o | 398 | |
| l | 303 | 6.4% |
| c | 282 | 6.0% |
| r | 277 | 5.9% |
| s | 213 | 4.5% |
| Other values (16) | 983 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 141 | |
| A | 141 | |
| P | 77 | 8.1% |
| S | 68 | 7.2% |
| C | 66 | 7.0% |
| U | 61 | 6.4% |
| T | 60 | 6.3% |
| M | 39 | 4.1% |
| V | 35 | 3.7% |
| E | 34 | 3.6% |
| Other values (14) | 226 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 99 | |
| 2 | 81 | |
| 0 | 42 | |
| 3 | 35 | 9.2% |
| 6 | 31 | 8.1% |
| 8 | 28 | 7.3% |
| 4 | 24 | 6.3% |
| 5 | 21 | 5.5% |
| 7 | 16 | 4.2% |
| 9 | 5 | 1.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 159 | |
| : | 87 | |
| . | 6 | 2.3% |
| ' | 2 | 0.8% |
| ; | 1 | 0.4% |
| / | 1 | 0.4% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 11 | |
| > | 1 | 8.3% |
Space Separator
| Value | Count | Frequency (%) |
| 544 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 93 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 41 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 41 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5670 | |
| Common | 1369 | 19.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 556 | 9.8% |
| a | 472 | 8.3% |
| e | 418 | 7.4% |
| n | 410 | 7.2% |
| t | 410 | 7.2% |
| o | 398 | 7.0% |
| l | 303 | 5.3% |
| c | 282 | 5.0% |
| r | 277 | 4.9% |
| s | 213 | 3.8% |
| Other values (40) | 1931 |
Common
| Value | Count | Frequency (%) |
| 544 | ||
| , | 159 | 11.6% |
| 1 | 99 | 7.2% |
| - | 93 | 6.8% |
| : | 87 | 6.4% |
| 2 | 81 | 5.9% |
| 0 | 42 | 3.1% |
| ( | 41 | 3.0% |
| ) | 41 | 3.0% |
| 3 | 35 | 2.6% |
| Other values (12) | 147 | 10.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7039 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 556 | 7.9% |
| 544 | 7.7% | |
| a | 472 | 6.7% |
| e | 418 | 5.9% |
| n | 410 | 5.8% |
| t | 410 | 5.8% |
| o | 398 | 5.7% |
| l | 303 | 4.3% |
| c | 282 | 4.0% |
| r | 277 | 3.9% |
| Other values (62) | 2969 |
unit_name
Categorical
IMBALANCE  MISSING 
| Distinct | 12 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 149483 |
| Missing (%) | 99.7% |
| Memory size | 2.3 MiB |
| G | |
|---|---|
| MG | |
| UG | |
| KCAL | 3 |
| IU | 3 |
| Other values (7) | 10 |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 1.653277 |
| Min length | 1 |
Characters and Unicode
| Total characters | 782 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | KCAL |
|---|---|
| 2nd row | KCAL |
| 3rd row | G |
| 4th row | G |
| 5th row | G |
Common Values
| Value | Count | Frequency (%) |
| G | 204 | 0.1% |
| MG | 184 | 0.1% |
| UG | 69 | < 0.1% |
| KCAL | 3 | < 0.1% |
| IU | 3 | < 0.1% |
| UMOL_TE | 3 | < 0.1% |
| MCG_RE | 2 | < 0.1% |
| PH | 1 | < 0.1% |
| SP_GR | 1 | < 0.1% |
| kJ | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
| (Missing) | 149483 |
Length
| Value | Count | Frequency (%) |
| g | 204 | |
| mg | 184 | |
| ug | 69 | 14.6% |
| kcal | 3 | 0.6% |
| iu | 3 | 0.6% |
| umol_te | 3 | 0.6% |
| mcg_re | 2 | 0.4% |
| ph | 1 | 0.2% |
| sp_gr | 1 | 0.2% |
| kj | 1 | 0.2% |
| Other values (2) | 2 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 463 | |
| M | 191 | |
| U | 75 | 9.6% |
| _ | 8 | 1.0% |
| E | 7 | 0.9% |
| L | 6 | 0.8% |
| C | 5 | 0.6% |
| A | 5 | 0.6% |
| T | 4 | 0.5% |
| I | 3 | 0.4% |
| Other values (8) | 15 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 773 | |
| Connector Punctuation | 8 | 1.0% |
| Lowercase Letter | 1 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 463 | |
| M | 191 | |
| U | 75 | 9.7% |
| E | 7 | 0.9% |
| L | 6 | 0.8% |
| C | 5 | 0.6% |
| A | 5 | 0.6% |
| T | 4 | 0.5% |
| I | 3 | 0.4% |
| O | 3 | 0.4% |
| Other values (6) | 11 | 1.4% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 8 |
Lowercase Letter
| Value | Count | Frequency (%) |
| k | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 774 | |
| Common | 8 | 1.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 463 | |
| M | 191 | |
| U | 75 | 9.7% |
| E | 7 | 0.9% |
| L | 6 | 0.8% |
| C | 5 | 0.6% |
| A | 5 | 0.6% |
| T | 4 | 0.5% |
| I | 3 | 0.4% |
| O | 3 | 0.4% |
| Other values (7) | 12 | 1.6% |
Common
| Value | Count | Frequency (%) |
| _ | 8 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 782 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| G | 463 | |
| M | 191 | |
| U | 75 | 9.6% |
| _ | 8 | 1.0% |
| E | 7 | 0.9% |
| L | 6 | 0.8% |
| C | 5 | 0.6% |
| A | 5 | 0.6% |
| T | 4 | 0.5% |
| I | 3 | 0.4% |
| Other values (8) | 15 | 1.9% |
nutrient_nbr
Real number (ℝ)
MISSING 
| Distinct | 461 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 149495 |
| Missing (%) | 99.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 516.19217 |
| Minimum | 200 |
|---|---|
| Maximum | 958 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 200 |
|---|---|
| 5-th percentile | 225 |
| Q1 | 321.2 |
| median | 510 |
| Q3 | 691 |
| 95-th percentile | 837 |
| Maximum | 958 |
| Range | 758 |
| Interquartile range (IQR) | 369.8 |
Descriptive statistics
| Standard deviation | 208.32844 |
|---|---|
| Coefficient of variation (CV) | 0.40358698 |
| Kurtosis | -1.248768 |
| Mean | 516.19217 |
| Median Absolute Deviation (MAD) | 186 |
| Skewness | 0.15583541 |
| Sum | 237964.59 |
| Variance | 43400.737 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 675 | 1 | < 0.1% |
| 673 | 1 | < 0.1% |
| 672 | 1 | < 0.1% |
| 671 | 1 | < 0.1% |
| 670 | 1 | < 0.1% |
| 669 | 1 | < 0.1% |
| 668 | 1 | < 0.1% |
| 667 | 1 | < 0.1% |
| 666 | 1 | < 0.1% |
| 665 | 1 | < 0.1% |
| Other values (451) | 451 | 0.3% |
| (Missing) | 149495 |
| Value | Count | Frequency (%) |
| 200 | 1 | |
| 201 | 1 | |
| 202 | 1 | |
| 203 | 1 | |
| 204 | 1 | |
| 205 | 1 | |
| 205.2 | 1 | |
| 206 | 1 | |
| 207 | 1 | |
| 208 | 1 |
| Value | Count | Frequency (%) |
| 958 | 1 | |
| 957 | 1 | |
| 956 | 1 | |
| 955 | 1 | |
| 954 | 1 | |
| 953 | 1 | |
| 952 | 1 | |
| 951 | 1 | |
| 950 | 1 | |
| 861 | 1 |
rank
Real number (ℝ)
MISSING 
| Distinct | 352 |
|---|---|
| Distinct (%) | 76.2% |
| Missing | 149494 |
| Missing (%) | 99.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 235701.64 |
| Minimum | 50 |
|---|---|
| Maximum | 999999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 50 |
|---|---|
| 5-th percentile | 1326.05 |
| Q1 | 7252.5 |
| median | 14550 |
| Q3 | 22075 |
| 95-th percentile | 999999 |
| Maximum | 999999 |
| Range | 999949 |
| Interquartile range (IQR) | 14822.5 |
Descriptive statistics
| Standard deviation | 414985 |
|---|---|
| Coefficient of variation (CV) | 1.7606368 |
| Kurtosis | -0.29677632 |
| Mean | 235701.64 |
| Median Absolute Deviation (MAD) | 7325 |
| Skewness | 1.3051822 |
| Sum | 1.0889416 × 108 |
| Variance | 1.7221255 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 999999 | 105 | 0.1% |
| 16211 | 2 | < 0.1% |
| 15100 | 2 | < 0.1% |
| 7500 | 2 | < 0.1% |
| 8730 | 2 | < 0.1% |
| 6250 | 2 | < 0.1% |
| 14250 | 2 | < 0.1% |
| 15601 | 1 | < 0.1% |
| 19200 | 1 | < 0.1% |
| 19100 | 1 | < 0.1% |
| Other values (342) | 342 | 0.2% |
| (Missing) | 149494 |
| Value | Count | Frequency (%) |
| 50 | 1 | |
| 100 | 1 | |
| 200 | 1 | |
| 280 | 1 | |
| 290 | 1 | |
| 300 | 1 | |
| 400 | 1 | |
| 500 | 1 | |
| 600 | 1 | |
| 700 | 1 |
| Value | Count | Frequency (%) |
| 999999 | 105 | |
| 23100 | 1 | < 0.1% |
| 23000 | 1 | < 0.1% |
| 22900 | 1 | < 0.1% |
| 22800 | 1 | < 0.1% |
| 22700 | 1 | < 0.1% |
| 22600 | 1 | < 0.1% |
| 22500 | 1 | < 0.1% |
| 22400 | 1 | < 0.1% |
| 22300 | 1 | < 0.1% |
| fdc_id | data_type | description | food_category_id | publication_date | id | nutrient_id | amount | data_points | derivation_id | min | max | median | footnote | min_year_acqured | name | unit_name | nutrient_nbr | rank | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 319874.0 | sample_food | HUMMUS, SABRA CLASSIC | 16.0 | 2019-04-01 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1 | 319875.0 | market_acquisition | HUMMUS, SABRA CLASSIC | 16.0 | 2019-04-01 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 2 | 319876.0 | market_acquisition | HUMMUS, SABRA CLASSIC | 16.0 | 2019-04-01 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 3 | 319877.0 | sub_sample_food | Hummus | 16.0 | 2019-04-01 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 4 | 319878.0 | sub_sample_food | Hummus | 16.0 | 2019-04-01 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 5 | 319879.0 | sample_food | HUMMUS, SABRA CLASSIC | 16.0 | 2019-04-01 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 6 | 319880.0 | market_acquisition | HUMMUS, SABRA CLASSIC | 16.0 | 2019-04-01 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 7 | 319881.0 | market_acquisition | HUMMUS, SABRA CLASSIC | 16.0 | 2019-04-01 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 8 | 319882.0 | sub_sample_food | Hummus | 16.0 | 2019-04-01 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9 | 319883.0 | sub_sample_food | Hummus | 16.0 | 2019-04-01 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| fdc_id | data_type | description | food_category_id | publication_date | id | nutrient_id | amount | data_points | derivation_id | min | max | median | footnote | min_year_acqured | name | unit_name | nutrient_nbr | rank | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 463 | NaN | NaN | NaN | NaN | NaN | 2050.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Genistin | MG | 718.0 | 19320.0 |
| 464 | NaN | NaN | NaN | NaN | NaN | 2051.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Glycitin | MG | 719.0 | 19330.0 |
| 465 | NaN | NaN | NaN | NaN | NaN | 2057.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Ergothionine | MG | NaN | 16255.0 |
| 466 | NaN | NaN | NaN | NaN | NaN | 2058.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Beta-glucan | G | NaN | 1327.0 |
| 467 | NaN | NaN | NaN | NaN | NaN | 2059.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Vitamin D4 | UG | NaN | 8730.0 |
| 468 | NaN | NaN | NaN | NaN | NaN | 2060.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Ergosta-7-enol | MG | NaN | 16210.0 |
| 469 | NaN | NaN | NaN | NaN | NaN | 2061.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Ergosta-7,22-dienol | MG | NaN | 16211.0 |
| 470 | NaN | NaN | NaN | NaN | NaN | 2062.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Ergosta-5,7-dienol | MG | NaN | 16211.0 |
| 471 | NaN | NaN | NaN | NaN | NaN | 2063.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Verbascose | G | NaN | 2450.0 |
| 472 | NaN | NaN | NaN | NaN | NaN | 2064.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Oligiosaccharides | MG | NaN | 2250.0 |